Development of Application-level Fault Tolerance in a Real-time Benchmark
نویسندگان
چکیده
As multiprocessor systems become more complex their reliability will need to increase as well In this paper we propose a novel technique which is applicable to a wide variety of distributed real time systems especially those exhibiting data parallelism We assert that for high reliability a combination of system level fault tolerance and application level fault tolerance works best In many systems application level fault tolerance can be used to bridge the gap when system level fault tolerance alone does not provide the required reliability We exemplify this with the RTHT target tracking benchmark
منابع مشابه
Fault Tolerant Supercomputing: A Software Approach
Adding fault tolerance to embedded supercomputing applications is becoming an issue of great significance, especially as these applications support critical parts of our everyday life in the modern “Information Society”. To this end, a software middleware framework is presented that features a collection of flexible and reusable fault tolerance modules acting at different levels and coping with...
متن کاملDeveloping dependable real-time systems
A growing number of safety-critical systems is controlled by computer systems. In the context of several research projects solutions were suggested how to reduce the implementation effort for dependable real-time systems. Unfortunately most of these approaches are based on special hardware solutions or restricted to specific application domains. In addition most of the application realize only ...
متن کاملOn the development of a sliding mode observer-based fault diagnosis scheme for a wind turbine benchmark model
This paper addresses the design of an observer-based fault diagnosis scheme, which is applied to some of the sensors and actuators of a wind turbine benchmark model. The methodology is based on a modified sliding mode observer (SMO) that allows accurate reconstruction of multiple sensor or actuator faults occurring simultaneously. The faults are reconstructed using the equivalent output err...
متن کاملOn the development of a sliding mode observer-based fault diagnosis scheme for a wind turbine benchmark model
This paper addresses the design of an observer-based fault diagnosis scheme, which is applied to some of the sensors and actuators of a wind turbine benchmark model. The methodology is based on a modified sliding mode observer (SMO) that allows accurate reconstruction of multiple sensor or actuator faults occurring simultaneously. The faults are reconstructed using the equivalent output err...
متن کاملA Heuristic Checkpoint Placement Algorithm for Adaptive Application-Level Checkpointing
Checkpoint/rollback is an effective scheme for fault tolerance and has been widely used to reduce the overall execution time of long-running applications in case of faults. The locations of checkpoints in application programs are critical since the distance between two consecutive ones determines the checkpointing scheme’s sensitivity and overheads. If they are too far apart, applications might...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998